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Abstract —We consider the downlink of Cell-Free Massive 
MIMO systems, where a very large number of distributed access 
points (APs) simultaneously serve a much smaller number of 
users. Each AP uses local channel estimates obtained from 
received uplink pilots and applies conjugate beamforming to 
transmit data to the users. We derive a closed-form expression 
for the achievable rate. This expression enables us to design an 
optimal max-min power control scheme that gives equal quality 
of service to all users. 

We further compare the performance of the Cell-Free Massive 
MIMO system to that of a conventional small-cell network and 
show that the throughput of the Cell-Free system is much more 
concentrated around its median compared to that of the small¬ 
cell system. The Cell-Free Massive MIMO system can provide 
an almost 20— fold increase in 95%-likely per-user throughput, 
compared with the small-cell system. Furthermore, Cell-Free 
systems are more robust to shadow fading correlation than small¬ 
cell systems. 

I. Introduction 

In Massive MIMO, large collocated or distributed antenna 
arrays are deployed at wireless base stations [1], Collocated 
Massive MIMO architectures, where all service antennas are 
located in a compact area, have the advantage that the backhaul 
requirements are low. By contrast, in distributed Massive 
MIMO, the service antennas are spread out over a wide 
area. Owing to its ability to fully exploit macro-diversity 
and differences in path loss, distributed Massive MIMO can 
potentially offer a very high probability of coverage. At the 
same time, it can offer all the advantages of collocated Massive 
MIMO. Despite its potential, however, fairly little work on 
distributed Massive MIMO is available in the literature. Some 
comparisons between distributed and collocated systems were 
performed in [2] for the uplink and in [3], [4] for the downlink. 
In [3], [4], perfect channel state information was assumed to 
be available both at the base station and at the users. 

On a parallel avenue, small-cell systems, where the base 
stations do not cooperate coherently, are often viewed as en¬ 
ablers for high spectral efficiency and coverage [5]. However, a 
comparison between small-cell and distributed Massive MIMO 
is not yet available. 

Contribution of the paper: We consider a distributed Mas¬ 
sive MIMO system where a large number of service antennas, 
called access points (APs), and a much smaller number of 
autonomous users are distributed at random over a wide area. 
All APs cooperate via a backhaul network, and serve all users 
in the same time-frequency resource via time-division duplex 
(TDD) operation. There are no cells or cell boundaries. Hence, 
we call this system “Cell-Free Massive MIMO”. Cell-Free 


Massive MIMO is expected to offer many advantages: 1) a 
huge throughput, coverage probability and energy efficiency; 
and 2) averaging out of small-scale fading and uncorrelated 
noise so that the performance depends only on the large-scale 
fading. In this paper, we restrict the discussion to the downlink 
of Cell-Free systems with conjugate beamforming. We assume 
that the channels are estimated through uplink pilots. The 
paper makes the following specific contributions: 

« We derive a closed-form expression for the achievable 
rate with a finite number of APs, taking into account 
channel estimation errors, power control, and the non¬ 
orthogonality of pilot sequences. In [6], corresponding 
capacity bounds were derived assuming pilot sequences 
are orthogonal. 

• We design two pilot assignment schemes: random pilot 
assignment and greedy pilot assignment. We show that 
the greedy pilot assignment is better. 

« We propose a max-min power control algorithm that 
maximizes the smallest of all user rates, under a per- 
AP power constraint. This power control problem is 
formulated as a quasi-convex optimization problem. 

« We compare the performance of Cell-Free Massive 
MIMO and conventional small-cell systems under uncor¬ 
related and correlated shadowing models. Conclusions of 
this comparison are given in Section VI. 

II. System Model 

We consider a network of M APs that serve K users in the 
same time-frequency resource. All APs and users are equipped 
with a single antenna, and they are randomly located in a large 
area. In this work, we consider the downlink with conjugate 
beamforming. We consider only the conjugate beamforming 
technique since it is simple and can be implemented in a 
distributed manner. The transmission from the APs to the users 
proceed by TDD operation, including two phases for each 
coherence interval: uplink training and downlink payload data 
transmission. In the uplink training phase, the users send pilot 
sequences to the APs and each AP estimates its own CSI. 
This is performed in a decentralized fashion. The so-obtained 
channel estimates are used to precode the transmit signals. 

A. Uplink Training 

Let T be the length of the coherence interval (in samples), 
and let r be the length of uplink training duration (in samples) 
per coherence interval. It is required that r < T. During 
the training phase, all K users simultaneously send pilot 


sequences of length r samples to the APs. Let tp k £ C TXl , 
where ||<£fc|| 2 = 1, be the pilot sequence used by the fcth user. 
Then, the r x 1 pilot vector received at the mth AP is 

K 

y m = v^E QmktPk A W m , (1) 

fc=1 


where p p is the normalized transmit signal-to-noise ratio 
(SNR) of each pilot symbol, w m is a vector of additive noise 
at the mth AP—whose elements are i.i.d. CAT ( 0,1) RVs, and 
g m k denotes the channel coefficient between the fcth user and 
the mth AP The channel g m k is modeled as follows: 


gmk — Pmk h m k , 


( 2 ) 


where h m k represents the small-scale fading, and B jr ,k repre¬ 
sents the path loss and large-scale fading. We assume that h m k, 
m = 1, .. •, M , k = 1,... AT, are i.i.d. CAf (0,1) RVs. The 
assumption of independent small-scale fading is reasonable 
since the APs and the users are distributed over a wide area, 
and hence, the set of scatterers is likely to be different for 
each AP and each user. 

Based on the received pilot signal y m , the mth AP will 
estimate the channel .g mfc ,Vfc = 1 Let y mk = y>fy m 

be the projection of y m on ipjB: 

K 

Vmk = \/ T PpPmk B~ y/ T Pl> ^ ' Qmk'^Pk Pk' T < Pk w m- (3) 

k'^k 

Although, for arbitrary pilot sequences, y m k is not a sufficient 
statistic for the estimation of g mk , one can still use y m k to 
obtain suboptimal estimates. In the case where any pair of 
pilot sequences is either fully correlated or exactly orthogonal, 
then y m k is a sufficient statistic, and estimates based on y m k 
are optimal. The MMSE estimate of g m k given y m k is 

„ WTppfimkljmk 

g m k =-——-——-—p- (4) 

T Pp 1 Pmk' |‘Pk Vk' | 4” 1 

Remark 1: If r > K, then we can choose <pi, ■ ■ ■ so 
that they are pairwisely orthogonal, and hence, the second term 
in (3) disappears. However, owing to the limited length of the 
coherence interval, in general, r < K, and non-orthogonal 
pilot sequences must be used by different users. The channel 
estimate g m k is degraded by pilot signals transmitted from 
other users (the second term in (3)). This causes the so- 
called pilot contamination effect. There is still room for a 
considerable number of orthogonal pilots. For the case of 
pedestrians with mobility less than 3 km/h, at a 2 GHz carrier 
frequency the product of the coherence bandwidth and the 
coherence time is equal to 17, 640 (assuming a delay spread 
of 4.76 ps) [7], so if half of the coherence time is utilized 
for pilots then there is an available pool of 8820 mutually 
orthogonal pilots. 


B. Downlink Payload Data Transmission 

The APs treat the channel estimates as the true channels, and 
use conjugate beamforming to transmit signals to the K users. 
Let Sk, k = 1,, K, where E {|s/c| 2 } = 1, be the symbol 


intended for the fcth user. With conjugate beamforming, the 
transmitted signal from the mth AP is 


K 



fc=l 


where g mk , m = 1,..., M, k = 1,... K, are power control 
coefficients chosen to satisfy the following average power 
constraint at each AP: 

E {\x m \ 2 } < Pd- (6) 

With the channel model in (2), the power constraint 
E {|x m | 2 } < pd can be rewritten as: 

K 

^ ^ Pmk^imk E 1? (7) 

k= 1 


where 



The received signal at the kth user is given by 


M M K 

'T'k— 'y ^ 9mk%m Trifc = \/ Pd ^ ^ ^ J ^rnk' 9mk9mk' $k' (9) 

m= 1 m=lk'=1 

where n k represents additive Gaussian noise at the fcth user. 
We assume that n k ~ CA/”(0,1). Then s k will be detected 
from r/c. 

III. Achievable Rate 

In this section, we derive a closed-form expression for the 
achievable rate, using similar analysis techniques as in [8]. We 
assume that each user has knowledge of the channel statistics 
but not of the channel realizations. The received signal r k in 
(9) can be rewritten as 

Tfc=DS/ L . • Sk + ENfc, (10) 

where DS fc = v'Pd® {£m=t Vmk9mkg^ is a deterministic 
factor that scales the desired signal, and EN/, : is an “effective 
noise” term which equals the RHS of (9) minus DS^ • s k - A 
straightforward calculation shows that the effective noise and 
the desired signal are uncorrelated. Therefore, by using the fact 
that uncorrelated Gaussian noise represents the worst case, we 
obtain the following achievable rate of the fcth user for Cell- 
Free (cf) operation: 

(11) 

We next provide a new exact closed-form expression for the 
achievable rate given by (11), for a finite number of APs. (The 
proof uses a standard Gaussian-is-worst-case-noise argument 
and is omitted here due to space limitations.) 

Theorem 1: An achievable rate of the transmission from the 
APs to the fcth user is given by (12), shown at the top of the 
next page. 
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IV. Pilot Assignment and Power Control 

In this section, we will present methods to assign the pilot 
sequences to the users, and to control the power. Note that 
pilot assignment and power control are decoupled problems 
because the pilots are not power controlled. 

A. Pilot Assignment 

As discussed in Remark 1, non-orthogonal pilot sequences 
must be used by different users due to the limited length of 
the coherence interval. The main question is how the pilot 
sequences should be assigned to the K users. In the following, 
we present two simple pilot assignment schemes. 

1) Random Pilot Assignment: Since the length of the pilot 
sequences is r, we can have r orthogonal pilot sequences. 
With random pilot assignment, each user will be randomly 
assigned one pilot sequence from the set of r orthogonal pilot 
sequences. 

This random pilot assignment method is simple. However, 
with high probability, two users that are in close vicinity of 
each other will use the same pilot sequence. As a result, the 
performance of these users is not good, due to the high pilot 
contamination effect. 

2) Greedy Pilot Assignment: Next we propose a simple 
greedy pilot assignment method, that addresses the short¬ 
comings of random pilot assignment. With greedy pilot as¬ 
signment, the K users are first randomly assigned K pilot 
sequences. Then the worst-user (with lowest rate), say user k, 
updates its pilot sequence so that its pilot contamination effect 
is minimized. The algorithms then proceeds iteratively. Note 
that the greedy pilot assignment can be performed at a central 
processing unit. 

The pilot contamination effect at the fcth user is re¬ 
flected by the second term in (3) which has variance 
Ylk'=£kfimk' | < / 5 a|Va;'| ■ So, the worst-user (the fcth user) will 
find a new pilot sequence which minimizes the pilot contam¬ 
ination, summed over all APs: 

M K 

argmin Y] /W \<Pk<Pk'\ ■ (13) 

' Pk i 

m —1 k'^tk 

Since ||<Pfc|| 2 = 1, the expression in (13) is a Rayleigh quotient, 
and hence, the updated pilot sequence of the fcth user is the 
eigenvector which corresponds to the smallest eigenvalue of 
the matrix Em=i E k'jtk Pmk'Vk'Vk'- 

B. Power Control 

In our work, we focus on max-min power control. Specif¬ 
ically, given realizations of the large-scale fading, we find 
the power control coefficients rj m k, m = !,••• ,M,k = 


1, • • • ,K, which maximize the minimum of the rates of all 
users, under the power constraint (7): 


max 

{Vmk} 

subject to 


min iff 

k=l,— ,K 


Efc= 1 Vruklmk <1, m = 1, ..., M 
Vmk> o, k=l,...,K, to = 1,..., M. 


(14) 


Define — Vmk- Then, by introducing the slack variables 
Qk'k and dm, and using (12), we can reformulate (14) as 


max mm 

{Smk,e k 'k^m} k= !,■■■ ,K 


subject to J2k'=l 7 mk'&k' m = 1 ,M 


f r-tM \ ^ 

1 2-^m= 1 imk^mk J 


K M 

E \<Pk'<Pk\ 2 Qk, k + E Pmkd 2 m 

k'^k m —1 


Pd 


Em=l 7 mk'J^Smk' < Qk'k , Vfc' ^ fc 

0 < d m <1, TO = 1, ..., M 

Smk >0, k = 1, ..., K , TO = 1,..., M. 


(15) 


We then have the following result. (The proof is omitted 
due to space constraints.) 

Proposition 1: The optimization problem (15) is quasi¬ 
concave. 

Since the optimization problem (15) is quasi-concave, it can 
be solved efficiently by using bisection and solving a sequence 
of convex feasibility problems [9]. 


V. Numerical Results and Discussions 
We provide some numerical results on Cell-Free Massive 
MIMO performance. The M APs and K users are uniformly 
distributed at random within a square of size 1000 x 1000 m 2 . 
The coefficient f3 m k models the path loss and shadow fading, 
according to 

Pmk = PL m fc • lO^ra'E (16) 

^sh z mk 

where PL m / r represents path loss, and 10 m represents 
shadow fading with standard deviation <r s h, and z rn .k ~ 
A/”(0,1). We use a three-slope model for the path loss [10]: 
the path loss exponent equals 3.5 if the distance between the 
TOth AP and the fcth user (denoted by d m k) is greater than d\, 
equals 2 if d\ > d m k > do, and equals 0 if d mk < do- To 
determine the path loss in absolute numbers, we employ the 
Hata-COST231 propagation model for d m k > d\. 

In all examples, we choose the following parameters: the 
carrier frequency is 1.9 GHz, the AP radiated power is 200 
mW, the noise figure (uplink and downlink) is 9 dB, the AP 
antenna height is 15 m, the user antenna height is 1.65 m, 
cr s h = 8 dB, do = 50, and do = 10 m. To avoid boundary 
effects, and to imitate a network with infinite area, the square 
area is wrapped around. 
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Fig. 1. The minimum per-user rate for different r, without power control. 
Here, M = 200, K = 50, and D = 1 km. 
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Fig. 2. The minimum rates without power control and with max-min power 
control. Here, M = 60, K = 20, and r = 10 and 20. 


A. Pilot Assignment 

We first examine the performance of Cell-Free Massive 
MIMO with different pilot assignment schemes, assuming that 
no power control is performed. Without power control, all 
APs transmit with full power, rjm k = r]mk', Vfc = 1 ,,K. 
Figure 1 shows the cumulative distribution of the minimum 
per-user rate which is given in (12) (the pre-log penalty due 
to the training phase is not accounted here), for M = 200, 
and K = 50, and for different r. The “bound” curve shows 
the case where all K users are assigned orthogonal pilot 
sequences, so there is no pilot contamination. As expected, 
when r increases, the pilot contamination effect reduces, and 
hence, the rate increases. We can see that when r = 20, then 
by using greedy pilot assignment the 95%-likely minimum rate 
can be doubled as compared to when random pilot assignment 
is used. In addition, even when r is very small (r = 5), Cell- 
Free Massive MIMO still provides good service for all users. 
More importantly, the gap between the performance of greedy 
pilot assignment and the bound is not significant. Hence, the 
greedy pilot assignment scheme is fairly good and henceforth 
this is the method we will use. 


1) Small-Cell Systems: To model a small-cell system, we 
assume that each user is served by exactly one AP. For a given 
user, the available AP with the largest average received power 
is selected. Once an AP is chosen by some user, this AP then 
becomes unavailable. The AP selection is done user by user in 
a random order. Mathematically, we let m,k be the AP chosen 
by the fcth user: 

m fc = argmax /3 mfc . 

m£{available APs} 


In small-cell operation, the users estimate the channels. Let 
ip k € C TXl be the pilot sequence transmitted from the m^th 
AP, and let p p be the average transmit power per pilot symbol. 
We assume that the APs transmit with full power. Similarly 
to the derivation of Cell-Free Massive MIMO (details omitted 
due to space constraints), we can derive an achievable rate for 
the fcth user in a small-cell system as: 


RT=E{ log. 


Pd ISmfcfc | 


M 


1 + - K 

Pd{/3m k k — Pm k k) + Pd X/ Pm k ,k + 1| 

\ k'^k ) , 


B. Max-Min Power Control 

In the following, we will examine effectiveness of the max- 
min power control. Fig. 2 shows the the cumulative distribution 
of the achievable rates with max-min and no power control 
respectively, for M = 60, K = 20, and r = 10, 20. We can 
see that with max-min power control, the system performance 
improves significantly. When r = 10, the max-min power 
allocation can improve the 95%-likely rate by a factor of 15 
compared to the case of without power control. In addition, 
by using power control, the effect of pilot contamination can 
be notably reduced. 

C. Cell-Free Massive MIMO versus Small-Cell Systems 

Next we compare the performance of Cell-Free Massive 
MIMO to that of small-cell systems. 


where g mkk ~ CJ\f (0, p mk k) is the channel estimate of g mkk , 

and Pm k k = y/ T PpPm k k/(. T Pt> X)fc' = l Pm k ,k \p k APk' | + !)• 

2) Spatial Shadowing Correlation Models: Transmit¬ 
ters/receivers that are in close vicinity of each other will 
experience correlated shadow fading. Next, we investigate the 
effect of shadowing correlation on both small-cell and Cell- 
Free Massive MIMO systems. 

Most existing correlation models for the shadow fading 
model two correlation effects from the user perspective: cross¬ 
correlation and spatial correlation [11]. The cross-correlation 
effect represents the correlation between the shadowing coeffi¬ 
cients from different base stations to a given user. The spatial 
correlation effect represents the correlation due to the relative 
positions between the users. This model neglects the effects of 
cross-correlation and spatial correlation from the base station 






















Fig. 3. The cumulative distribution of the throughput per user for correlated 
and uncorrelated shadow fading. Here, M = 60, K = 20, and r = 10. 


sides. Doing so is reasonable when the base station antennas 
are located high above the ground, and all base stations are 
well separated. However, in our system models, both the APs 
and the users are located randomly in the network, and they 
may be close to each other. In addition, the AP antenna 
elevation is not very high. Hence, the correlation at the AP 
side should be taken into account as well. We will use the 
following modified correlation model for the shadow fading: 

Zmk = yfpia m + 'Jl-p 1 b k , m = l ,..., M, k = l,...,K, 

where a m ~ Af(0, 1) and bk ~ A/"(0,1) are independent. The 
quantity pi (0 < p\ < 1) represents the cross-correlation at 
the AP side, and 1 — p\ represents the cross-correlation at the 
user side. The spatial correlation is reflected via the correlation 
between a m , m = 1,..., M, and the correlation between bk, 
k = 1,. .., K. We define p 2 ,mm' = E {a m a^,} and p 3ikk > = 
IK {bkb'l ,}. These correlation coefficients depend on the spatial 
locations of the APs and on the users: 

tj a (m,m') d u (k,k') 

P2,mm’ =2 ^decorr , P3,fefe' =2 d decorr , (17) 


where d a (m,m') is the distance between the mth and m'th 
APs, d u (k,k') is the distance between the kth and fc'th users, 
and ddecorr is the decorrelation distance which depends on the 
environment. This model for the correlation coefficients has 
been validated by practical experiments [11]. 

For the comparison between Cell-Free Massive MIMO and 
small-cell systems, we consider the per-user throughput which 
is defined as: 


S k k =B 


1 - r/T 


R 


A 

fcJ 


where A £ {cf, sc} corresponding to Cell-Free Massive 
MIMO and small-cell systems respectively, B is the spectral 
bandwidth, and T in the length of the coherence interval 
in samples. Note that both Cell-Free Massive MIMO and 
small-cell systems need K pilot sequences for the channel 
estimation. There is no difference between Cell-Free Massive 
MIMO and small-cell systems in terms of pilot overhead. 


Furthermore, we assume that both systems use the same set of 
pilot sequences. We choose T = 200 samples, B = 20 MHz, 
rfdecorr = 0.1 km and pi = 0.5. 

Figure 3 compares the cumulative distribution of the 
throughput per user for small-cell and Cell-Free systems, for 
M = 60, K = 20, and r = 10, with and without shadow 
fading correlation. Compared to the small-cell systems, the 
throughput of the Cell-Free systems is much more concen¬ 
trated around its median. Without shadow fading correlation, 
the 95%-likely throughput of the Cell-Free system is about 15 
Mbits/s which is 17 times higher than that of the small-cell 
system (about 0.85 Mbits/s). We can see that the small-cell 
systems are more affected by shadow fading correlation than 
the Cell-Free Massive MIMO systems are. In this example, the 
shadow fading correlation reduces the 95%-likely throughput 
by factors of 4 and 2 for small-cell and Cell-Free systems, 
respectively, compared to the case of uncorrelated shadowing. 

VI. Conclusion 

We analyzed the performance of Cell-Free Massive MIMO 
systems, taking into account the effects of channel estimation. 
We further compared the performance of Cell-Free Massive 
MIMO to that of small-cell systems. 

The results show that Cell-Free systems can significantly 
outperform small-cell systems in terms of throughput. The 
95%-likely per-user throughput of Cell-Free Massive MIMO 
is almost 20 times higher than for a small-cell system. Also, 
Cell-Free Massive MIMO systems are more robust to shadow 
fading correlation than small-cell systems. 
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